Asymmetric matrices in an analysis of financial correlations 
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Financial markets are highly correlated systems that reveal both the 
inter-market dependencies and the correlations among their different com- 
ponents. Standard analyzing techniques include correlation coefficients for 
pairs of signals and correlation matrices for rich multivariate data. In the 
latter case one constructs a real symmetric matrix with real non-negative 
eigenvalues describing the correlation structure of the data. However, if one 
performs a correlation-function-like analysis of multivariate data, when a 
stress is put on investigation of delayed dependencies among different types 
of signals, one can calculate an asymmetric correlation matrix with complex 
eigenspectrum. From the Random Matrix Theory point of view this kind 
of matrices is closely related to Ginibre Orthogonal Ensemble (GinOE). We 
present an example of practical application of such matrices in correlation 
analyses of empirical data. By introducing the time lag, we are able to 
identify temporal structure of the inter-market correlations. Our results 
show that the American and German stock markets evolve almost simul- 
taneously without a significant time lag so that it is hard to find imprints 
of information transfer between these markets. There is only an extremely 
subtle indication that the German market advances the American one by 
a few seconds. 

PACS numbers: 89.75.-k, 89.75.Da, 89.75.Fb, 89.65. Gh 

1. Introduction 

A number of studies have shown that different financial markets reveal 
hierarchical structure P^EIBIIIIIHIIBIIZIIBI that can be approximated by 
factor and group models (e.g. for the stock market case). At the level 

of financial data, these structures are determined principally by strength of 
correlations in returns of different stocks, currencies or other assets. The 
most popular methods of such an analysis are based on the calculation of 
correlation matrices from multivariate time series of returns. The correla- 
tion matrices can then be diagonalized in order to obtain spectra of their 
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eigenvalues and eigenvectors [111 |IJ El or can serve as a source for the 
construction of minimal spanning trees ^ 1131 El I14j . In the standard ap- 
proach, in which the correlations between all analysed assets are taken into 
consideration, the correlation matrix is by construction symmetric due to 
the correlation coefficient invariance under a swap of signals. This obvi- 
ously leads to a real eigenspectrum of the matrix. Usually properties of 
the empirical correlation matrix are compared with universal predictions 
of the adequate, Wishart ensemble of random matrices and the identified 
deviations are considered as an indication of actual correlations among data. 

In principle, however, there is no restriction imposed on the symmetry 
property of a correlation matrix: it may well be antisymmetric or even com- 
pletely asymmetric, depending on which signals are used in the calculations. 
For example, if there are two separate sets of signals and the correlations 
are calculated only across these two sets, the resulting matrix can no longer 
be symmetric and, consequently, its eigenspectrum can be complex. How- 
ever, there is still a non-zero probability that some of the eigenvalues and 
eigenvectors are real. As long as a distribution of the correlation matrix 
elements is close to a Gaussian, the most relevant random matrix ensem- 
ble, against which the results should be tested, is the Ginibre Orthogonal 
Ensemble (GinOE) [T^]. For the financial data characterized by fat tails 
of p.d.f. this assumption can also be made provided the time series under 
study are sufficiently long. 

At present one observes in literature a growing interest in theoretical re- 
search on properties of real asymmetric and, more generally, non-Hermitean 
random matrices. This interest is motivated by a broadening spectrum of 
applications of such matrices which includes, among others, random net- 
works ^nj) quantum chaos ^7]) quantum chromo dynamics ^3 EI! an d 
brain research [2U]. An issue which we address in this work and which can 
serve as an example of application of the asymmetric correlation matrices 
to empirical data can be related to a globalization of financial markets. We 
investigate the cross-market correlations between returns of stocks traded 
on two large but geographically distant markets: New York Stock Exchange 
and Deutsche Borse. Our objective is to identify the strength of the instan- 
teous as well as the time lagged dependencies between evolution of these 
two markets. 

2. Methods 

We begin with presenting a brief construction scheme of an asymmetric 
correlation matrix and a short description of basic properties of GinOE. Let 
us consider the two disjoint sets X, Y each consisting of N assets and denote 
by {x^ }i=i,...,T and {y® }i=i,„.,T the time series of normalized logarithmic 
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Fig. 1. Probability density function of complex eigenvalues of exemplary GinOE 
random matrix (N = 30) obtained by averaging the spectra over 100000 individual 
matrix realizations. Colors range from black (probability density close to zero) to 
red (highest probability density). 



returns of assets s£X and t £ Y (s, t = 1, ...,N). For each set we construct 
an N x T data matrix M and the correlation matrix Cxy according to 
formula 

Cxy = ^MxMy. (1) 

Each matrix element — 1 < C 8 t < 1 is the Pearson cross-correlation coeffi- 
cient for assets s and t (C s> t ^ Ct,s)- In the next step the correlation matrix 
can be diagonalized by solving the eigenvalue problem 

CxYV fc = A fc v fc , k = l,...,N (2) 

which provides us with a complete spectrum of generally complex eigenval- 
ues Afc and pairs of conjugated eigenvectors v^. The assumption Y = X in 
Eq.Q leads to the standard definition of a symmetric correlation matrix 
Cxx with a real eigenspectrum. 

Properties of the empirical correlation matrix have to be tested against 
a null hypothesis of completely random correlations characteristic for inde- 
pendent signals. Random Matrix Theory (RMT) offers some analytic results 
for a corresponding ensemble of real asymmetric matrices, i.e. the Ginibre 
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Orthogonal Enseble jT5; defined by the Gaussian probability density 

fbinOE(C) = (2vr)- Af2 /2 eX p[-TV(CC T /2)], (3) 

where C stands for N x N real matrix. In the limit of N — > oo the eigenvalue 
spectrum of a GinOE matrix is homogeneous and assumes a regular elliptic 
shape in the complex plane [2lJ 



p(X) 




^Rez^2 _|_ ^ Imz ^2 < j 
(#) 2 + (^) 2 > 1 1 



where a = 1 + 7, 6 = 1 — 7 an d 7 parametrizes a degree of matrix sym- 
metry (7 = 1, 7 = — 1 correspond to, respectively, symmetric matrix with 
all eigenvalues being real and antisymmetric matrix with imaginary eigen- 
values, while 7 = means full asymmetry). In physical situations with 
finite N, these spectra, however, loose their homegenity due to excess of 
real eigenvalues Aft e whose expected number expressed as a fraction of N in 
the N — > 00 limit reads [22] 



lim — „ 



= y/ 2 /{N*) . (4) 



A typical eigenvalue p.d.f. in the complex plane of a random matrix (N = 
30) obtained from 100000 independent matrix realizations is displayed in 
Figure 1. 



3. Results 

Our example of an application of the asymmetric correlation matrix is 
based on high frequency data from NYSE and Deutsche Borse [23, spanning 
the interval 1 Dec 1997 — 31 Dec 1999. We analyze JV = 30 stocks belonging 
to the Dow Jones Industrials group and the same number of stocks consti- 
tuting the main German DAX30 index. We calculate each element of C by 
cross-correlating the time series pairs representing all possible combinations 
of an American and a German stock. We neither consider the correlations 
inside the German market nor inside the American one. In order to inves- 
tigate temporal dependencies between both markets we introduce a time 
lag r and associate it with the German stocks, i.e. we look at {x^ }j=i ... t 

and {j/}2 T }t=i,...,T> where r can assume both positive and negative integer 
values. Thus, r > denotes a retardation of all the signals corresponding 
to German stocks while r < denotes the opposite case. 

Since the two markets under study are separated by a few time zones, 
their activities overlap only for a relatively short period of a trading day 
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Fig. 2. (a) Probability density function of empirical correlation matrix (histogram) 
together with fitted Gaussian distribution (red solid line) for At — 3 s and for 
zero time lag. (b) Spectrum of complex eigenvalues of correlation matrix for the 
same data as in (a). The largest real eigenvalue is pointed by an arrow. Dashed 
circle denotes theoretical eigenvalue spectrum for GinOE multiplied by standard 
deviation of matrix elements. 
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Fig. 3. Eigenvalue spectra in complex plane for empirical correlation matrix calcu- 
lated for different values of time lag: r = 120 s (a) and r = 300 s (b). 



(only the days that were common to both markets are considered). For most 
time it was only 90 minutes a day from 9:30 to 11:00 New York time (15:30 
to 17:00 Frankfurt time) and only after changing the trading hours in the 
German floor starting from 20 Sep 1999 the overlap interval increased to 120 
minutes (from 9:30 to 11:30 in New York and 15:30 to 17:30 in Frankfurt). 
This means that actually we can analyze the time series spanning 47700 
minutes total. A good time resolution should be a crucial aspect of our 
analysis hence we consider only short time scales of returns: from At = 120 
seconds down to At = 3 seconds. Shorter time scales cannot be used due 
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Fig. 4. | Ai (t) I (vertical lines and full circles) and A2 (t) (red solid line) for a few 
different time scales of returns: At = 120 s (a), At — 60 s (b), At — 30 s (c) and 
At = 15 s (d). 



to a fact that transaction times in the TAQ database are stored with only 
1 s resolution. 

First of all let us look at the correlation matrix and its eigenvalues for r = 
(no time lag, synchronous evolution of both markets). Figure 2(a) presents 
p.d.f. of the matrix elements C s t for At = 3 s (histogram) together with 
a fitted Gaussian distribution (red solid line). Except the central part of 
the empirical distribution, where there are excessive small positive elements 
and lacking small negative ones, the Gaussian is well approximated by the 
histogram (the same refers to the other time scales). Thus, the correlation 
matrix can be treated as a sum of an essentially random core matrix 
and a non-random part carrying the actual inter-market correlations. This 
suggests that we can expect the eigenvalue spectrum consisting of an RMT 
bulk and at least one significant non-random eigenvalue responsible for the 
correlations. In fact, exactly this type of spectrum can be seen in Figure 
2(b). All except one eigenvalues are localized inside the RMT prediction for 
a completely asymmetric matrix and the remaining largest one is distant 
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Fig. 5. |Ai(r)| (vertical lines and full circles) and | A2 (t) (red solid line) for the 
shortest time scale At = 3 s. 

and resides on the real axis. By an analogy to a symmetric matrix we are 
justified to associate this eigenvalue with the coupling strength of the two 
markets (the global market factor). Interestingly, there is no other factor 
which can influence the behaviour of some smaller parts of the markets like 
e.g. specific economic sectors. 

Figure 3 shows examples of the eigenspectra for two different positive 
time lags. As we increase r from s up to 5 min, we observe a gradual 
decrease of |Ai| which remains real even for r > 120 s, but eventually looses 
its identity by drowning in the sea of random eigenvalues for At = 5 min. 
From the market perspective, after such a time interval the stocks traded in 
Frankfurt forget about what happened earlier in New York. We however still 
cannot say anything decisive about the possible directional information flow 
between the markets. It requires a more systematic investigation in which 
the largest eigenvalue Ai (i.e. the one with the largest absolute magnitude) 
becomes a function of variable r. Figure 4 displays Ai(r) for different time 
scales of the returns. It can be seen that with the resolutions of At = 120 
and 60 s the maximum coupling between the markets occurs for synchronous 
signals and the non-random correlations exist for — 3 < r < 3 minutes. For 
At = 30 s a weak trace of asymmetry in both the maximum position and 
the memory length can be identified, which is confirmed in the plot for 
At = 15 s. Going down to the shortest time scale of 3 s, this asymmetry 
becomes clear. Figure 5 documents that the stocks from both markets are 
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maximally correlated if the American market is retarded by about 3-15 
seconds in respect to its German counterpart. This observation is somehow 
counterintuitive because one might expect that the American stock market, 
being the largest in the world and representing the world's largest economy, 
is less dependent on external influence than is the German market. We 
cannot give a straightforward explanation of this phenomenon, though. Its 
source can lie in memory properties of the American market as well as in 
some specific behaviour of investors in the beginning of a trading day in 
New York. For example, they may carefully observe the evolution of the 
European markets which in the years 1998-99 used to finish their activity 
rather soon after the American markets had been opened. We also cannot 
exclude the possibility that the reason for this is a possible existence of 
artifacts in the trade recordings in TAQ or KKMDB databases which cannot 
be identified in data. Finally, the observed asymmetry of the curve tails in 
Figures 4 and 5 with respect to r = can be explained, at least in part, by 
different autocorrelation properties of the two markets under study. This is 
evident in Figure 6, where the largest eigenvalue of the symmetric matrix 
Cxx is calculated separately for the German and for the American markets. 
Here r assumes only non-negative values due to a symmetry of the problem; 
Ai(r) is a multivariate counterpart of the autocorrelation function. It is 
clear from Figure 6 that the German market has considerably longer and 
stronger memory than its American counterpart; in fact, this memory can 
lead to longer-lasting cross-dependencies presented in Figures 4 and 5 if 
the German market is retarded. On the other hand, investors in Frankfurt 
may need a longer time to collect all the information needed before they 
make investment decisions if they take more markets and more information 
into consideration. It is also possible that the American stock market is 
technically more advanced and, on average, allows the investors to react 
quicker than in Germany. 



4. Conclusions 

We construct an asymmetric real correlation matrix from time series of 
returns representing two separate groups of stocks: German and American 
ones. Nonexistence of a symmetry condition allows us to concentrate solely 
on the inter-market correlations without mixing them with the correlations 
that are inner to only one market, and to study temporal properties of such 
correlations. We introduce a time lag associated with German stocks and 
investigate traces of direct information transfer from one market to the other 
which can manifest itself in the existence of significant non-synchronous 
couplings between the markets represented by a r-shifted maximum in the 
largest eigenvalue of the empirical correlation matrix. We identified such 
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Fig. 6. |Ai(t)| for symmetric correlation matrix Cxx calculated for the Ameri- 
can (black circles) and the German (blue squares) stocks separately. Longer and 
stronger memory in the latter case is visible. 

delayed correlations indicating that the same information is shared by both 
markets with the American one following its German counterpart only after 
a few seconds. This observation, however, cannot be treated as a fully 
convincing one due to a significant broadening of the Ai(t) maximum and 
an unintuitive direction of this transfer from a smaller towards a larger 
market. Another conclusion from our results is that the coupling between 
the two analyzed markets is only of a one-factor type. We do not noticed 
other, more subtle partial couplings that can involve a subset of stocks. 

Our results can be compared with the results of ref. j^H] hi which an 
analysis of the delayed correlations between different stocks traded on the 
American market are studied by means of the correlation coefficients. It is 
worth mentioning that a similar analysis can also be performed by applying 
the asymmetric correlation matrices used in our work. 
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